Performance Impacts with Reliable Parallel File Systems at Exascale Level
نویسندگان
چکیده
The introduction of Exascale storage into production systems will lead to an increase on the number of storage servers needed by parallel file systems. In this scenario, parallel file system designers should move from the current replication configurations to the more space and energy efficient erasure-coded configurations between storage servers. Unfortunately, the current trends on energy efficiency are directed to creating less powerful clients, but a larger number of them (light-weight Exascale nodes), increasing the frequency of write requests and therefore creating more parity update requests. In this paper, we investigate RAID-5 and RAID-6 parity-based reliability organizations in Exascale storage systems. We propose two software mechanisms to improve the performance of write requests. The first mechanism reduces the number of operations to update a parity block, improving the performance of writes up to 200%. The second mechanism allows applications to notify when reliability is needed by the data, delaying the parity calculation and improving the performance up to a 300%. Using our proposals, traditional replication schemes can be replaced by reliability models like RAID-5 or RAID-6 without the expected performance loss.
منابع مشابه
U.S. Department of Energy Best Practices Workshop on File Systems & Archives: Usability at Los Alamos National Lab
There yet exist no truly parallel file systems. Those that make the claim fall short when it comes to providing adequate concurrent write performance at large scale. This limitation causes large usability headaches in HPC computing. Users need two major capabilities missing from current parallel file systems. One, they need low latency interactivity. Two, they need high bandwidth for large para...
متن کاملSimHEC: Understanding Application Efficiency at Exascales through Simulations
It is expected that our HEC system will enter exascale era in decade, which is one thousand times of performance as today’s system (petascale). In the mean time, many challenges also have been noticed and pointed out, as the size of HEC system increased without some dispensable improving on architecture of today’s HEC system, the systems could collapse at exascale, because the functionality wou...
متن کاملESSEX: Equipping Sparse Solvers for Exascale
The ESSEX project investigates computational issues arising at exascale for large-scale sparse eigenvalue problems and develops programming concepts and numerical methods for their solution. The project pursues a coherent co-design of all software layers where a holistic performance engineering process guides code development across the classic boundaries of application, numerical method, and b...
متن کاملPerformance at Exascale
Introduction Exascale systems will consist of complex configurations with a huge number of potentially heterogeneous components. Deep software hierarchies of large, complex software components will be required to make use of such systems. While the software layers are designed to be transparent, they are typically not transparent with respect to performance. This performance intransparency wil...
متن کاملExploring Data Compression in Distributed File Systems
The ever-growing imbalance between computation and I/O is one of the fundamental challenges for current petascale and future exascale systems. As these large systems become more compute heavy by design, it is becoming harder to support dataintensive applications with adequate performance. To narrow the gap between computation and I/O, parallel and distributed file systems have been introduced. ...
متن کامل